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Abstract. Let V : M. d — > A be a query problem over M. d for which there exists a data 
structure S that can compute V(q) in O(logra) time for any query point q 6 M. d . Let D 
be a probability measure over M. d representing a distribution of queries. We describe a 
data structure T = Tpjj, called the odds-on tree, of size 0(n e ) that can be used as a filter 
that quickly computes V(q) for some query values in M. d and relies on S for the remaining 
queries. With an odds-on tree, the expected query time for a point drawn according to D 
is 0(H* + 1), where H* is a lower-bound on the expected cost of any linear decision tree 
that solves V. 

Odds-on trees have a number of applications, including distribution-sensitive data 
structures for point location in 2-d, point-in-polytope testing in d dimensions, ray shoot- 
ing in simple polygons, ray shooting in polytopes, nearest-neighbour queries in M. d , point- 
location in arrangements of hyperplanes in M. d , and many other geometric searching prob- 
lems that can be solved in the linear-decision tree model. A standard lifting technique 
extends these results to algebraic decision trees of constant degree. A slightly different 
version of odds-on trees yields similar results for orthogonal searching problems that can be 
solved in the comparison tree model. 



1 Introduction 

Geometric search problems have a long and rich history [2, 26, [36]. In these problems, we are 
typically given a collection S of n geometric objects in M. d and asked to store them in a data 
structure so that we can efficiently answer queries about these objects. Until quite recently, 
the performance of these data structures was typically measured, at least theoretically, in 
terms of the worst-case (over all possible choices of S and q) query time as a function of n. 
A worst-case query time of O(logn) is typically viewed as the gold standard for such search 
problems, at least in models of computation that only allow binary decisions. 

Somewhat more recently, researchers have begun studying geometric search prob- 
lems under the lens of distribution- sensitivity. In this setting, one assumes that there is a 
probability measure D over the set of possible queries, and one attempts to optimize the 
expected query time when queries are distributed according to D. For example, given a 
planar triangulation G and a distribution D over M. 2 , one can construct an 0(n) sized data 
structure that can determine the face of G that contains any query point q £ M 2 . The 
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expected query time of this structure is 0(H + 1), where 

ff = J>]og(l/pO (1) 

i 

and pi denotes the probability that q is contained in the ith face of G [H El \7\ 13 [28J 
129] . Information theory tells us that this result is optimal in any dichotomous model of 
computation where any execution branching has at most 2 possible outcomes. 

More recently, Collette et al. \\.7\ [T8] have shown that a similar result holds for 
point location in any simple connected planar subdivision G. Note that this involves more 
than simply triangulating G and applying the results for triangulations. Triangulating G 
increases the number of faces and therefore also increases the value of H in Q. On the 
other hand, we can not expect to always achieve a query time of 0{H + 1) since this would 
imply, for example, an 0(1) query-time data structure for testing if a point is contained in 
a simple polygon. The result of Collette et al. is a data structure whose expected query 
time is 0(H* + 1) where H* is a lower-bound on the expected cost of any linear decision 
tree for point location in G with queries distributed according to D. 

Dujmovic et al. [23] study distribution-sensitive 2-sided 2-dimensional orthogonal 
range counting. They describe a data structure, called a biased range tree, that preprocesses 
an n point set S C M 2 and a query distribution D over M 2 . A biased range tree uses 
0(n log n) preprocessing time and space and can answer 2-sided 2-dimensional orthogonal 
range counting queries over S in expected time 0(H* + 1), where H* is a lower-bound 
on the expected cost of any comparison tree for 2-sided range queries over S with queries 
distributed according to D. 

The results of Collette et al. \17\ 118] and Dujmovic et al. [23] use similar techniques 
to prove their optimality, but each requires their own ad hoc arguments. For example, the 
point location results are achieved by finding near-minimum-entropy Steiner triangulations 
and then applying existing distribution-sensitive results for point location in triangulations. 
Biased range trees on the other hand, mix k-d trees, fractional cascading, and biased binary 
search trees to achieve their running time. 

In the current paper, we describe a general and low-overhead method of taking any 
O(logn) query-time data structure and making it distribution-sensitive. Let V : M rf — > A 
be a query problem over M. d for which there exists a data structure S that can compute 
V(q) in O(logra) time for any query point q € R d {^] Let D be a probability measure over 
M. d representing a distribution of queries. We describe a data structure T = T-pjj, called 
the odds-on tree^of size 0{n e ) that can be used as a filter that quickly computes V{q) for 
some query values in M. d and relies on S for the remaining queries. With an odds-on tree, 
the expected query time for a point drawn according to D is 0(H* + 1), where H* is a 
lower-bound on the expected cost of any linear decision tree that solves V . 

For any constant integer p > 1, a standard lifting technique allows us to lift queries 
from R d into R d ' , with <f = ( p+d ) - 1 so that there is a correspondence between d'-variate 

1 There is no natural definition of n for the abstract problem V . Nevertheless, all the problems we will 
eventually consider arise from a set of n objects in R d and, for all these problems O(logn) is the optimal 
worst-case query time in the models of computation we will consider. 

2 odds-on • adj. having a better than even chance of success "the odds-on favourite" 
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linear inequalities (halfspaces in W 1 ) and <i-variate polynomial inequalities of maximum 
degree p [23]. This technique yields the same result, except that H* becomes a lower-bound 
on the expected cost of any degree p algebraic decision tree that solves V . 

Odds-on trees have a plethora of applications, including distribution-sensitive data 
structures for point location in 2-d, point-in-polytope testing in d dimensions, ray shoot- 
ing in simple polygons, ray shooting in polytopes, nearest-neighbour queries in M. d , point- 
location in arrangements of hyperplanes in M. d , and many other geometric searching prob- 
lems that can be solved in the linear-decision tree model. Furthermore, a variant of the 
odds-on tree that works in the comparison tree model provides distribution-sensitive data 
structures for orthogonal searching problems in the comparison tree model. 

The remainder of this paper is organized as follows: Section [2] presents some pre- 
liminary definitions and background material. Section [3] presents the odds-on tree and 
algorithms for constructing it. Section [4] proves that the odds-on tree matches the query 
time of any linear decision tree. Section [5] presents some of the geometric applications of 
this data structure. Finally, Section [6] summarizes and concludes with directions for future 
work. 

2 Preliminaries 

Throughout this paper, the underlying dimension, d, is a constant, and other constants 
defined in the paper may (implicitly) depend on d. A simplex in R d is the common inter- 
section of a set of at most d + 1 closed halfspaces in M. d . Note that, under this definition, 
simplices need not be bounded and M. d , as well as 0, are both simplices. 

Throughout this paper, we assume an underlying probability measure D over M. d . All 
expectations and probabilities are (implicitly) with respect to D. For any subset X C M. d , 
Pv(X) refers to D(X). We use the notation D\ x to denote the distribution D conditioned 
on X, i.e., D [X (Y) = Pr(Y | X) = Pr(Iny)/Pr(I) for all Y C R d . If F= {X u ...,X k } 
are non-overlapping subsets of M d then the entropy of F, denoted H(F) is 

k 

H{F) = ^Pr(X i |UF)log(l/Pr(X i |UF)) , 

i=l 

where for any set S, US denotes Uses s - The probability measure D is used as an input to 
our algorithms. We assume that the algorithm has access to D through a Sampling Oracle 
that allows us to draw a random sample g£l <i from the distribution D. 

A query problem over ~R. d is a function V : M. d — > A where A is some set of answers. 
We assume the existence of two oracles that allow access to V . The Backup Oracle allows us 
to compute V(q) for any query point q, but requires O(logre) time to do so. The Interference 
Oracle allows us to test, for any simplex A, if there exists p, q £ A with V(p) ^ F{q). The 
running time of the Interference Oracle will be unspecified. 

The oracles are used in the following ways: The Backup Oracle is used to answer 
any queries that can not be answered directly by the odds-on tree. The Sampling Oracle 
and the Interference Oracle are used only during the construction of the odds-on tree. The 
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running times in theorems in Sections [3j and [4] all specify the number of invocations of the 
Sampling and Interference Oracle used. When discussing applications in Section [5l efficient 
implementations of the Interference Oracle for specific problems will be described. 

A decision tree for V is a rooted ordered binary tree in which each internal node 
v is labelled with a function vj : M rf — > {0, 1} and each leaf w is labelled with an element 
wa G A. A query point q G M. d follows a root-to-leaf path, proceeding to the right child of 
v if Vf(q) = 1 and the left child of v if Vf(q) = 0. For a decision tree T and a point q G R rf , 
we denote by T{q) the label of the leaf on the root-to-leaf path for p in T. A decision tree 
solves V if T(q) = V(q) for all q G M. d . The depth of a node v in a rooted tree T, denoted 
deptbj>(t;), is the number of edges on the path from v to the root of T. The (expected) cost 
of a decision tree, denoted //£>(T), is the expected depth of the leaf reached when p is drawn 
according to the probability measure D. 

Decision trees are classified based on the types of functions, vj, used at their nodes. 
In a linear decision tree, each Vf is a linear inequality. In a degree p algebraic decision tree, 
each Vf is a d-variate polynomial inequality of degree p. In a comparison tree, vj is a simple 
comparison that compares one coordinate of a query point g to some value. 

Entropy, query problems, and decision trees are all related by (half of) Shannon's 
Source Coding Theorem [42] : 

Theorem 1 (Shannon 1948). Let V : R d -> A be a query problem and define V (o) = 
{q G R d : V(q) = a}, for any a G A. Then, for any decision tree T that solves V , 

H D {T) > H{{P~\a) :aeA}) . 

3 The Data Structure 

In this section we describe a data structure T-p^ called the odds-on tree that, in conjunction 
with a Backup Oracle, yields a data structure that solves V and has expected query time 
that is within a constant factor of the expected cost of any linear decision tree T* for V. 
For a reader familiar with Matousek's efficient partition trees [33], the executive summary 
of this section is as follows: An odds-on tree is essentially a partition tree on a sample of 
0(n e ) points drawn according to the distribution D. A complete description follows. 

Theorem 2 (Matousek 1992). There exists a constant c such that, for any set S ofm points 
in M. d and any constant r < m, there exists a sequence (Ai, . . . , A r ) of closed simplices such 
that U- =1 Ai = R d , 

1. | A* n S\ < 2m/r, where A* = A; \ (ijpi Aj) , and 

2. For any hyperplane t, there are at most cr l ~ l l d elements of {Ai, . . . , A r } whose in- 
teriors intersect I. 

The sequence of simplices Ai, . . . , A r can be computed in 0(m) time. 
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Note that Condition 1 of Theorem [2] is not in the original statement of the theorem, 
but follows from Matousek's incremental construction of Ai, . . . , A r |33j. 

Let S be a set of m points in M. d . Then the partition tree Tg for S is a rooted 
ordered tree obtained by recursively applying Theorem [2] The root of Tg has r children 
corresponding to the simplices Ai, . . . , A r obtained by applying Theorem [2] to S. The ith 
child of the root is itself the root of the partition tree Tg n A* for S PI A*. This recursive 
process stops when the set S contains at most 1 point or when the depth exceeds some 
pre-specified maximum depth k. 

Next we define some regions, A(u), 0(f), and E(w), that are associated with each 
node v of Tg. Every node v in Tg, except the root of Tg, is naturally associated with a 
simplex A(v) that was obtained from Theorem [2] and that generated v. For the root of Tg, 
we define A(v) = R d . 

For a node v of T whose ancestors are v\, . . . ,Vi we define 0(f) = A(v ) np|* =1 A(v»). 
Note that 0(f) C A(f), and that 0(f) is a convex polytope that has 0(((d+l)(i+l))L d / 2 J) = 
0(iL rf / 2 J) vertices since it is the intersection of at most (d + + 1) halfspaces [34] . 

For a point q G R d , the search path for q in Tg starts at the root and proceeds to 
the first child i such that q G A, (note that this implies q G A*) and this process is applied 
recursively until reaching a leaf of Tg. In this way, for every node v of the partition tree 
there is a maximal subset H(w) C M rf such that the search path for every point q G H(t») 
contains u. Note that E(«) C 0(t>) C A(v), but that H(u) is not necessarily convex or even 
connected. 

We extend the definitions of S, 0, and A to sets of nodes in T in the natural way; 
if V is a set of nodes in T, then H(V) = {H(u) : u G V}, 0(F) = {0(f) : v £ V}, and 
A(V) = {A(«) : v G V}. 

The following theorem summarizes the properties of the partition tree Tg [33J (each 
property is inherited from the corresponding property of the simplicial partition in Theo- 
rem]^: 

Theorem 3. Let S be a set of m points in R rf , let Tg denote the partition tree described 
above, and let Vi denote the set of at most r % nodes o/Tg at depth i. There exists a constant 
c, independent of r and m, such that the partition tree Tg has the following properties, for 
every i G Z: 

1. For every node v G Vi, \S D < m(2/r) 1 , and 

For any hyperplane £, the number of elements in O(V^) whose interiors intersect £ is 
at most (cr 1_1 / d ) J . 

The partition tree Tg can be constructed from S in 0(m log m) time. 

The following is a sampling version of Theorem [3] that we will use in the construction 
of an odds-on tree: 

Theorem 4. Let S be a sample of m points in M. d i.i.d. according to D, let Trj = Tg denote 
the partition tree given by Theorem^ and let Vi denote the set of at most r % nodes of To at 
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depth i. There exists a constant c, independent of r, m and D, such that with probability at 

1 /2 

least 1 — 0(e~ m ), Tp has the following properties, for every i G {0, . . . , |_(1 / 4) log r mj }: 

1. For every node v 6 Vi, Pr(E(v)) < (3/r)*, and 

2. For any hyperplane I, the number of elements in 0(Vi) whose interiors intersect i is 
at most (cr 1-1 / d )\ 

The sample partition tree Tq can be constructed in O(mlogm) time plus the cost of 0(m) 
calls to the Sampling Oracle. 

Proof. Condition 2 follows with certainty from Theorem [3j In this proof we bound the 
probability of failure for Condition 1. Let k = [(1/4) log r mj . 

We will use D m (A) to denote the empirical measure of a set A C WL d : 

def \snA\ 



D m (A) 



m 



From Theorem [3] we have 



Now, 



Pr sup D(H(u)) > 



su V D m {E{v)) < (2/r) 1 . 



= Pr ^U, eyi 



D(E(v))-D m (E(v))> 
D(E(v))-D m (E(v))> 



D m (E(v)) 



< Pr sup (D(E{v))- D m (E(v)))>r- 

< Pr (sup (D(A) - D m {A)) > r~A 

\AeA J 



where A are sets formed by taking the intersection of k closed simplices and subtracting 
k{r — 1) (possibly empty) simplices from this intersection. This is because E{V) C A. We 
can actually handle all levels i £ {0, . . . , k} of the tree at once with the inequality 



Pr U 



ie{i,...,k} 



sup D(E(v)) > 



< Pr sup (D{A) - D m (A)) > r~ 



The class A for k = 1 and r = 1 is the class of all simplices in W 1 . Since a simplex in M. d is 
the common intersection of d + 1 halfspaces in it helps to first consider halfspaces. The 
set of halfspaces in M. d has Vapnik-Chervonenkis dimension d+ 1 (this follows from Radon's 
Theorem [24J). By Sauer's lemma [40j [20, pages 28-29], the number of subsets of an m-point 
set that can be obtained by intersections with halfspaces does not exceed (m + l) d+l . A 
simple combinatorial argument then implies that the number of subsets of an m-point set 
that can be obtained by intersections with simplices does not exceed (m + l)( d+1 ) . 



6 



Assume now general r and k. Then the number of subsets of an m-point set that can 
be obtained by intersections with sets from A does not exceed (m + iy k ( d + l ) ; by the same 
combinatorial argument. By a version of the Vapnik-Chervonenkis inequality [43J shown by 
Devroye [19] . 



Pr sup \D(A) - D m (A)\ >t) < 4 (m 2 + l) 



rk(d+l) 2 



for allt > 0. Thus, 



sup D(E(v)) > 



since k = (1/4) log r m. 



exp [At + At 2 - 2mt 2 ) 



< 4 (m 2 + i) rk ^ 2 exp (4r" fc + 4r" 2fc - 2mr" 2fc 



< exp [2 + 4r~ fc + 4r~ 2fc + rk(d + l) 2 ln(m 2 + 1) - 2mr 2k 

< exp (lO + r(d + l) 2 log r (m) ln(m 2 + 1) - 2m 1/2 



O e 



,1/2 



□ 



Note that, so far, we have not considered the query problem V at all; the sample 
partition tree To of Theorem [4] is defined completely in terms of the probability measure D. 
The odds-on tree T-p,D for (V, D) is obtained in the following way: We start by constructing 
a sample partition tree Tjj as described in Theorem [4] using the value m = n T for some 
parameter r > and setting the maximum depth to k = [(1/4) log r m\ . 

Next, we trim some nodes of T-p.D- We use the Interference Oracle to test, for each 
node v of Td, if V(p) = V(q) for all pairs of points p,q £ 0(v). If so, we remove all the 
subtrees rooted at the children of v , we call v a terminal leaf, and we label v with the label 
£(v) = V(q). Note that the Interference Oracle works for simplices, but 0(f) is a polytope. 
Thus, this test requires decomposing 0(v) into simplices and using the Interference Oracle 
on each simplex. Using the bottom vertex triangulation [15] for this decomposition allows 
us to decompose 0(v) into 0((depth T (t>))l- £i / 2 J) = log ^ n simplices in 0(\og ^ n) time. 
This yields the following lemma: 

Lemma 1. For any n > and any t > 0, an odds-on tree of size m = n T , with maximum 
depth k = [(1/4) log r /3 mj , and having the properties of Theorem^ can be constructed in 
0{n T log ^ n) time plus the cost of 0(n T ) calls to the Sampling Oracle and 0(n T log *- 1 ) n) 
calls to the Interference Oracle. 



Using an odds-on tree to answer a query q G M. d , is easy: We follow the search path 
for q in T-p^ until we either reach a terminal leaf v , in which case we output £(v ), or we reach 
a non-terminal leaf w after O(logn) steps, in which case we rely on the Backup Oracle to 
report V(v) in O(logn) time. The correctness of this procedure follows immediately from 
the definition of terminal and non-terminal nodes. In the next section, we analyze the 
performance of odds-on trees. 
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4 Analysis 



In this section, our goal is to lower-bound the expected cost of any linear decision tree that 
solves V in terms of the odds-on tree Tp t jj. We accomplish this by decomposing the nodes 
of T-p t D into subsets with some helpful combinatorial properties. 

An i-set of a rooted tree T is a set of vertices in T all of which are at depth at 
most i and in which no vertex in the set is the ancestor of any other vertex in the set. We 
say that a set of regions F = {X\, . . . , Xt}, Xi C M. d , is in k-general position if there is no 
hyperplane that intersects k or more elements of F. 

Lemma 2. Let T-p d be the odds-on tree defined in Section^ let V be an i-set ofT-p p, 
and let k > 1 be a constant. Then V contains a subset V' C V such that the elements 
of 0(V) are pairwise disjoint and in k-general position and \ V'\ = £l{\V\/r t ( d l k+1 ~ 1 / d+ ^), 
where 5 > is a decreasing function of r. 

Proof. We will first use the probabilistic method [3j to establish the existence of a (not 
necessarily disjoint) set V" satisfying the size and fc-general position requirements and then 
show that V" contains a large subset V' whose elements are also pairwise disjoint. 

Let V" be a Bernoulli sample of V where each element is selected independently 
with probability p = r -*{d/k+l-i/d+S) . "W e Wli i prove that 

Pr{0(V") is in fc-general position and \V"\ = Cl(p\V\)} > . 

Consider any hyperplane £. Condition 2 of Theorem [4] implies that I intersects the interior 
of at most {cr l ~ l / d ) % elements of V for some constant c. The probability that I intersects 
the interior of k or more elements of V" is therefore no more than 

((cr)^V d y\ pk ^ {cr y(k-k/d) p k 

For each node v G V, 0(v) has 0(i L d / 2 J ) vertices, and the number of nodes in V is at 
most r l . Therefore, the elements of 0(V) define a test set L of 0((zL d / 2 Jr l ) d ) = Oii d2 l 2 r d% ) 
hyperplanes such that 0(V") is in A:-general position if and only if no hyperplane in L 
intersects k or more elements of 0(V"). The probability that any hyperplane in L intersects 
k or more elements of 0(V") is therefore at most 

0(i d2/2 r di ) ■ (cr) l{k - k l d ^p k = 0{r^ d+k - k ' d+s ^)p k = 0{r 5 ~ k ^) = o(l) 

for any 5 > {k — k/d)\og r c. The above argument shows that the nodes in V" are quite 
likely to be in /c-general position. To see that V" is sufficiently large, we simply observe 
that \V"\ is a binomal(|V|,p) random variable and therefore has median value at least 
Lp|F|J = n(\V\/r i ( d / k+1 - 1 / d+ V). Therefore, 

Pr {0(V") is in fc-general position and \ V"\ = n(\V\/r i W k+1 ~ 1 / d+s ^) } > l-(o(l)+l/2) > 

This establishes the existence of a sufficiently large set V" such that 0(V") is in fc-general 
position. 
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Finally, we select V C V" so that the elements of O(V') are pairwise disjoint. To 
do this, imagine sweeping a hyperplane £(t) = {(xq, . . . , xj) G M d : xq = t} from t = — oo 
to +00. Associate with each element v G V, the maximal interval [a v , b v ] such that 0(v) 
intersects £(t) for all t G [a„, This yields a set SV" of rea l intervals such that no point is 
contained in k or more elements of Sy>. Dilworth's Theorem |21| implies that Sy contains 
a subset of size at least \V"\/k of non-overlapping intervals. This subset corresponds to 
a subset V C V" with |V'| > |V"|/A; and such that the elements of O(V') are pairwise 
disjoint. The set V' satisfies all the conditions of the lemma. □ 

We are now ready to show that the expected search time in the odds-on tree is a 
lower bound on the expected cost of any linear decision tree that solves V . 

Lemma 3. Let Tp t jj be the odds-on tree defined in Section^ and assume Tp t D satisfies 
Conditions 1 and 2 of Theorem^ Let L denote the set of leaves ofT-p^, and let T* be any 
linear decision tree that solves V . Then 

fi D (T*) = Q(H(E(L)) - 1) . 

Proof. This proof mixes the ideas from the proofs of Lemma 3 by Dujmovic et al. |23| and 
Lemma 4 by Collette et al. [TTj. For an i-set V of nodes in Tp t £>, we define the shorthands 
Pr(V) = Pr(UH(V)) and H(V) = H(E(V)). Let T be the tree obtained from T V>D by 
removing all terminal leaves, and let V denote the set of leaves of T' . Note that 

H(L') = H(L) - O(logr) = H(L) - O(l) 

since each leaf in L' has at most r = O(l) children in L. 

We first partition L' into groups Gi, G2, ■ ■ ., where Gi contains all leaves v such that 
1/2*- 1 > Pr(H(u)) > 1/2*. Note that Condition 1 of Theorem [| implies that the depth of 
a node in Gi is at most i/log(r/3). 

We then further partition each group Gi into subgroups G^i, . . . , G^. For each 
j G {0, . . . , Gj^-i}, \Gij\ = Q(2 ai ), for some constant a > and the elements of 0(Gij) 
are pairwise disjoint and in /c-general position. Furthermore, the final subgroup, Gj ^ has 
size at most 0(2 /3 *), for some constant /3 < 1. 

This partitioning is accomplished by repeatedly applying Lemma [2] to remove a 
subset Gij C Gi that is in A;-general position and has size Q(2 ai ), stopping the process once 
the size of Gj drops below 0(2 lBl ). This works provided that we choose /3, k, and r so that f3 > 
((bgr)/(log(r/3)))((d/fe+l-l/d+<y) and set a < p-(Qogr)/Qog(r/3)))(d/k+l-l/d+8). 
For example, by choosing r and k to be sufficiently large, /3 = 1 — l/(3d) and a = I /(3d). 

Consider the linear decision tree T* that solves V . The leaves of T* partition M. d 
into cells whose closures are convex polytopes. For a leaf w of T* we denote its polytope 
by 0(w). If the depth of w is i, then 0(w) is the intersection of at most i halfspaces. This 
implies that 0(w) intersects at most ik elements of O(Gjj) since, otherwise, 0(u>) contains 
0(v) for some non-terminal node v G Gij. (This would contradict the assumption that T* 
solves V '.) 
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Let w be some leaf of T* such that 0(w) intersects t elements of 0(Gij). Then, 
by the discussion in the previous paragraph, depth T *(u;) > \t/k~\. We can easily create a 
subtree of w whose height is at most t and with the property that 0(w') intersects at 
most one element of O(Crjj) for each leaf w' of T^,J^] If we do this for every leaf w of T* we 
obtain a tree T*- such that every leaf of T*- intersects at most one element of 0(Gij). 

Let Dij = -D|uh(g, : j) denote the distribution D conditioned on UH(Gij). Note that 
the leaves of T*- could be relabeled so that they indicate which element of O(Gjj) (if any) 
that they intersect. By Shannon's Theorem (Theorem [TJ , this implies that 

{k + 1) • MD^Cr*) > IHKJT- ,) > H(G id ) . 

It follows |18} Lemma 3] that 

(k + 1) ■ fi D (T*) > H(L') - H({uE(G ld ) :t€N, j G {1, . . . , U}) - O(l) . 

Thus, all that remains is to upper-bound the contribution of H = i?({US(Gjj) : i G N, j G 
{1, . . . , ij}), as follows: 

= H({uG itj :ieN,j e{l,...,U}) 

OO ti 

= EE Pr ( G M)l°g(l/Pr(G M )) 

i=i j=i 

OO I ti — 1 

= E E Pr ( G 'M)log(l/Pr(G M )+Pr(G Mi )log(l/Pr(G M J) 
i=i \i=i 

OO / ti — 1 

< £ j] PrC^j) log(2*- ai ) + i2^- i+1 + O(l) 
i=i \i=i 

< (1 - a)H{L') + O(l) . 

Thus, we have 

(k + l)fi D (T*) > H(L')-H-0(1) 

> aH(L') - 0(1) 

> aH(L)-0(l) 

= n(H(L) - 1) , 

so fi£)(T*) = Q(H(L) — 1), as required. □ 

Remark: By more carefully handling the constants in the proof of Lemma |3j and allowing 
k and r to be arbitrarily large, one can obtain the tighter lower-bound 

H D {T*) + log^Cr*)) >»H- 0(log(fcr)) 
where a can be made arbitrarily close to 1/d by increasing k and r. 



a Fot example, making the root of correspond to the bisector of two polyhedra of O(Gij) that intersect 
0(ui) means that each of the children of the root intersect at most t—1 elements of 0(Gij). Applying this 
recursively yields a subtree of height at most t — A slightly more involved argument can produce a tree 
T* of depth O(logt). 
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Theorem 5. Let V : M rf — > A be a decision problem for which we have a (Oilogn) time) 
Backup Oracle and an Interference Oracle, and let D be any probability measure over M rf 
for which we have a Sampling Oracle. Then, for any constant e > 0, an odds-on tree of size 
0(n e ) can be constructed in 0(n e ) time plus the cost of 0(n e ) calls to the Sampling Oracle 
and 0(n € ) calls to the Interference Oracle. 

This odds-on tree can, in conjunction with the Backup Oracle, compute "P(g) for 
any q E M. d drawn according to D in 0(H* + 1) expected time, where H* < ^^(T*) for any 
linear decision tree T* that solves V . 

Proof. Applying Lemma [T] with r < e yields the stated bounds on the construction time 
and the use of the Sampling and Interference Oracles. 

If T-p.D satisfies Conditions 1 and 2 of Theorem [4] then, by Lemma [3j the expected 
time to answer queries using T-p,D is 

^Pr(i)0(depth T (i)) =£)Pr(t)0(log(l/Pr(t))) = 0(H(L)) . 

Otherwise, the expected time to answer queries using Tp t jj is O(logn). Therefore, the 
expected time (where the expectation is taken over D and the random sampling used to 
generate S in Theorem [4]) to answer a query using an odds-on tree is 

(1 - 0(e~ nT/2 )) ■ 0(H(L)) + 0(e- nT/2 ) ■ O(logn) = 0{H(L) + 1) . 

On the other hand, by Lemma [3] the expected cost of any linear decision tree T* that solves 
V is 

fi D (T*) = n(H(L) - 1) , 
which completes the proof. □ 

Using a standard lifting of query points [44J, any degree p algebraic decision tree 
that solves a problem V : M. d — > A can be implemented as a linear decision tree in M d , with 
d' = — 1- Applying this yields the following corollary: 

Corollary 1. Let V : M. d — > A be a decision problem for which we have an (0(logn) time) 
Backup Oracle and an Interference Oracle, and let D be any probability measure over M rf for 
which we have a Sampling Oracle. Then, for any constants e > and p > 1, using 0{n e ) 
calls to the Sampling Oracle and 0(n e ) calls to the Interference Oracle, an odds-on tree 
can be constructed in 0{n e ) time and 0(n e ) space. This odds-on tree can, in conjunction 
with the backup oracle, answer V -queries drawn according to D in 0{H* + 1) expected time, 
where H* < /X£>(T*) for any degree p algebraic decision tree T* that solves V . 

5 Applications 

In this section, we discuss a few of the many potential applications of odds-on trees. In all 
of our applications, the problem V is a query problem over some set of n geometric objects 
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in M. d . In order to shorten the statements of the theorems in this section, we say that a 
data structure for a problem V is distribution-sensitive (in the linear decision tree model) if 
the expected query time of the data structure is 0(/j,d(T*) + 1) for any linear decision tree 
T* that solves V. Before delving into the details of the applications, we first outline some 
general strategies for implementing the Interference Oracle needed to build an odds-on tree. 

An odds-on tree can be made to have size 0(n e ) for any constant e > 0. Furthermore, 
the number of calls to the Interference Oracle made during the construction of an odds- 
on tree is 0(n e ). In almost all of our applications, the Interference Oracle can be trivially 
implemented to run in O(n) time by testing the query simplex A against each input element. 
This means that, even with no preprocessing, the contribution of calls to the Interference 
Oracle to the construction time of an odds-on tree is no more than 0(n 1+e ). Some of the 
subsequent theorems in this paper will use this fact implicitly. 

In other cases, the Interference Oracle corresponds to a natural query for which 
there exists (or we can develop) an 0{n l ~ e ) query-time O(nlogn) preprocessing-time data 
structure. In these cases, the time to construct the odds-on tree becomes 0(n log n). One 
particularly common instance of this occurs when Interference Oracle queries can be reduced 
to 0(1) simplex range counting queries in W 1 for some constant dimension a". In this case, 
Matousek's partition trees [33] yield an Interference Oracle that can be constructed in 
0(n log n) time and that can answer queries in 0(n 1 ~ 1 l d+e ) time. 

5.1 Point Location Problems 

The planar point location problem is to determine which face of a planar straight line graph 
G contains a query point q 6 M 2 . A number of authors have considered distribution- 
sensitive algorithms for this problem [U EJ E] EJ EJ H3 EHl EH] and have mostly solved it. 
The most general such result is due to Collette et al. [17J and gives a distribution-sensitive 
data structure for point location in connected planar subdivisions. This leaves open the 
case where the graph, G, of the subdivision is not connected. Since there are several 
O(logn) query-time, 0(n log n) preprocessing-time data structures for planar point location 
in (possibly disconnected) planar subdivision [U [251 E01 E3 [39] we can apply odds-on trees. 

To obtain a fast preprocessing time, we can use an implementation of the Interference 
Oracle based on partition trees. For this problem, the Interference Oracle must answer 
queries of the form: "Does the interior of query triangle A intersect more than one face of 
G?" We can build a data structure for this problem by first removing from G any edges 
and vertices not on the boundary of more than 1 face to obtain a graph G'. The interior of 
A intersects more than one face of G if and only if the interior of A intersects an edge of 
G'. 

If A intersects an edge of G' then A contains a vertex of G' or some edge of A 
intersects some edge of G' . Determining if A contains a vertex of G' is the classic 2- 
dimensional simplex-range counting problem that can be solved in 0(n 1 / 2+e ) time after 
0(n log n) preprocessing using partition trees. To determine if some edge uw of A intersects 
some edge xy of G' , we observe that uw and xy intersect if and only if 

L(u, w, y) A L(x, y, u) A L(w, u, x) A L(y, x, w) 
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or 

R(u, w, y) A R(x, y, u) A R(w, u, x) A R(y, x, w) 

where L(a,b,c) and R(a,b,c), are predicates that are true if and only if the sequence of 
points abc form a left turn, respectively, a right turn. If we fix x and y, then each of 
the above predicates is the sign of a linear function over the variables U\, 112, w%, and W2 ■ 
Therefore, we can treat each edge of G' as a point in M 4 , yielding a point set Sg> C M 4 
such that testing if edge uw intersects some edge of G' can be reduced to two simplex range 
counting queries over Sc- Again, partition trees allow us to do this in 0(n 3//4+e ) time 
after 0{n log n) preprocessing. Therefore, the Interference Oracle for point location can be 
implemented to run in 0(n 3/,4+e ) time after 0(n log n) preprocessing. This yields our first 
theorem: 

Theorem 6. There exists a distribution-sensitive data structure for the planar point location 
problem that uses 0(n log n) preprocessing time and 0(n) space. 

The 3-d point in polytope problem is the problem of determining if a query point 
g G I 3 is contained in a 3-dimensional polytope P. The Dobkin-Kirkpatrick hierarchy 
|22| gives an O(logn) query-time, 0(n) preprocessing-time data structure for this problem. 
Furthermore, Interference Oracle queries (which involve testing if a tetrahedron intersects 
the boundary of P) can also be answered in O(logn) time by the Dobkin-Kirkpatrick 
hierarchy. 

Theorem 7. There exists a distribution-sensitive data structure for the 3-dimensional point 
in polytope problem that uses 0(n) preprocessing time and space. 

These results can be extended to higher dimensions, though with more space |15j : 

Theorem 8. There exists a distribution-sensitive data structure for the d- dimensional point 
in polytope problem that uses 0(n^- d ^ 2 ^ +e ) preprocessing time and space. 

The problem of point-location in an arrangement of hyperplanes is the problem of 
determining which cell in an arrangement of n hyperplanes in M. d contains a query point q £ 
M d . Liu gives an 0{n d ) space and preprocessing-time, O(logn) query-time data structure 
for this problem |31j . 

Theorem 9. There exists a distribution-sensitive data structure for point location in an 
arrangement of hyperplanes that uses 0(n d ) preprocessing time and space. 

5.2 Post-Office Queries 

For an n point set S C M d , the d-dimensional post-office problem asks for a point of p G S 
that minimizes the Euclidean distance \\pq\\ for a query point q G M^. 

The post-office problem can be solved efficiently through the use of point location 
in Voronoi diagrams. In 2-dimensions, Voronoi diagrams are planar graphs of size 0{n) 
and can be computed in 0(n log n) time J38] . Combining this with Theorem [6] gives a 
distribution-sensitive data structure for the 2-d post-office problem: 
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Theorem 10. There exists a distribution-sensitive data structure for the 2- dimensional 
post office problem that uses 0(n log n) preprocessing time and 0(n) space. 

In d > 2 dimensions, the post-office problem can still be solved using Voronoi dia- 
grams, but the space and preprocessing costs are higher [16J: 

Theorem 11. There exists a distribution-sensitive data structure for the 2-dimensional 
post office problem that uses 0(n^ d ^ 2 ^ +t ) preprocessing time and space. 

5.3 Ray Shooting 

The ray shooting in a polygon problem asks for the first point on the boundary of a polygon 
P intersected by a query ray. A query ray can be represented as a pair of points in M 2 (the 
source and any other point on the ray), so this is a query problem over IR 4 . Several 0(n) 
preprocessing time O(logn) query time solutions to this problem exist \12\ 127]. 

Theorem 12. There exists a distribution-sensitive data structure for the ray shooting in a 
polygon problem that uses 0(n 1+e ) preprocessing time and 0{n) space. 

The ray shooting in a d-dimensional polytope problem asks for the first point on the 
boundary of a convex polytope P C ~M. d intersected by a query ray. For polytopes in M 3 , the 
Dobkin-Kirkpatrick hierarchy |22j can perform ray shooting in O(logn) time per query. 

Theorem 13. There exists a distribution- sensitive data structure for the ray shooting in a 
3- dimensional polytope problem that uses 0(n 1+e ) preprocessing time and 0(n) space. 

Schwarzkopf |41j gives an 0(nL rf / 2 J +<E ) space, O(logn) query time solution for the 
problem of ray shooting in a d-dimensional polytope with d > 4. 

Theorem 14. There exists a distribution-sensitive data structure for the ray shooting in a 
d-dimensional polytope problem that uses 0(nL rf / 2 J +e ) preprocessing time and space. 

5.4 Orthogonal Range Counting 

The 2-d orthogonal range counting problem is the problem of counting the number of points 
of a data set Scl 2 that are contained in a query rectangle [q%, qj[ x [q^, q±\. (Note that this 
produces a query point q G M 4 .) Bentley's range trees [10], with fractional cascading [13 [32], 
yield an 0{n log n) preprocessing time and space, O(logn) query time data structure for this 
problem. Interference Oracle queries for 2-d orthogonal range counting require determining 
if any rectangle represented by a point in a 4-dimensional simplex has some point of S on its 
boundary. This problem can be decomposed into O(l) simplex range counting queries in M 4 
in a manner similar to that used for the point location problem. Thus, an Interference Oracle 
for this problem can be implemented in 0(n 3//4+<E ) time after 0(n log n) preprocessing. 

Theorem 15. There exists a distribution- sensitive data structure for the 2-d orthogonal 
range counting problem that uses O(nlogn) preprocessing time and 0(n log n) space. 



14 



Range trees satisfy the constraints of the comparison tree model of computation, 
which is considerably weaker than the linear decision tree model. Dujmovic et al. |23j 
give a distribution-sensitive 2-d orthogonal range counting data structure that works in 
the comparison tree model. However, their technique can only answer 2-sided queries, i.e., 
queries of the form [(71,00) x [(72,00). 

Filter trees can be made to work in the comparison tree model and handle full (4- 
sided) 2-d orthogonal range counting queries. The key modification required is the use of k-d 
trees in Theorem [4] rather than Matousek's partition trees. (This method, in 2 dimensions, 
is essentially how Dujmovic et al. obtain their results.) 

In the comparison tree model, an Interference Oracle query is a (f-dimensional box, 
rather than a simplex. In the special case of 2-d orthogonal range counting, Interference 
Oracle queries can be reduced to a constant number of rectangular range counting queries 
among the input set S. Therefore, Interference Oracle queries can, in this case, be answered 
in O(logn) time after 0(n log n) preprocessing using range trees. 

Theorem 16. There exists a distribution-sensitive data structure in the comparison tree 
model for the 2-d orthogonal range counting problem that uses 0(n log n) preprocessing time 
and space. 



6 Summary and Conclusions 

We have presented a data structure — the odds-on tree — that can be added on to any data 
structure that answers queries drawn from some distribution D over R rf . If the underlying 
data structure has query time O(logn), then the combined data structure will have optimal 
expected query time in the linear decision tree model. A variant of the odds-on tree based 
on k-d trees provides a similar result in the comparison tree model. Section [5] contains a 
smattering of applications of the odds-on tree. Many more are possible. 

Note that Corollary [I] applies to all of the problems in Theorems [6 16 to yield, 



for any constant p, data structures that are distribution-sensitive in the degree p algebraic 
decision tree model. However, when applying Corollary [TJ the Interference Oracle becomes 
considerably more complicated, so that the fast preprocessing times in Theorems [6j [7j [TUj 
and 15 increase to 0(n 1+e ). However, the real power of Corollary [l] is its applications to 



problems that cannot be solved by finite linear decision trees. Examples of such problems 
include fixed-radius circular ray shooting in polygons [Hj, point-location in 2-dimensional 
power diagrams [9], point-location in planar subdivisions whose edges are defined by alge- 
braic curves (such as arrangements of circles), and many others. 

We believe that the odds-on tree may actually be practical in some settings. The 
comparison-tree version of the odds-on tree is based on k-d trees, which are simple and 
widely used in practice. At worst, a comparison-based odds-on tree will perform O(elogn) 
extra comparisons before falling back to the backup data structure. 

In low dimensions, simpler alternatives to Matousek's Partition Theorem (Theo- 
rem [2]) are available and may be more practical. For example, in 2-d one can, 0(n) time, 
find two lines that partition any m point set into four point sets each of size at most [m/4] 
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and such that no line intersects more than the 3 of the resulting sets [35] • This result is 
strong enough that it can be used in place of Theorem [2] to prove all our results (for 2-d 
problems) and yields an odds-on tree that only requires 2 point-line comparisons at each 
node. 

Because the odds-on tree is so small (of size 0(n e )) it may be useful to speed up 
searching in environments where memory is constrained. For example, it can be applied 
to the succinct point location data structures of Bose et al. [11] to obtain a distribution- 
sensitive and succinct data structure for point location. In external-memory settings, an 
odds-on tree may also be useful as a filter that is sufficiently small to fit into internal memory 
while the backup structure lives in (larger but slower) external memory. 
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